High Quality Thermostat Control by Reinforcement Learning -a Case Study A. System Description

نویسنده

Martin Riedmiller

چکیده

High Quality Thermostat Control by Reinforcement Learning A Case Study Martin Riedmiller Institut f ur Logik, Komplexit at und Deduktionssysteme Universit at Karlsruhe, D-76128 Karlsruhe, Germany e-mail: [email protected] Abstract| Temperature control is an important issue in many manufacturing processes. The requirement for high precision, fast reaction to disturbances, time delays of varying length due to considerably changing characteristics of the respective production process make it a challenging application eld for the improvement and development of reinforcement learning techniques. The article shows some rst results on the application of a neural reinforcement learning controller to a thermostat control problem. Open problems are discussed and some ideas for further research directions are presented. to appear in: Proceedings of CONALD '98, CMU, Pittsburgh I. A Thermostat Controller In many manufacturing applications it is important to keep a liquid (water, oil, chemical substance) at a certain temperature. Reasons for this may be that a chemical reaction only has the desired outcome, if the temperature is kept within (very) tight bounds. This is the case for example in wafer production processes, but many more industrial applications exist. They considerably vary with respect to the quality and the amount of the liquids used, resulting in a broad range of di erent process characteristics. This variety makes it very di cult and costly to design a controller that shows good control characteristics in every application situation. Reinforcement learning seems to be a promising approach to overcome this problem by learning to adapt the control law to varying scenarios. A. System description The following hardware structure is a common apparatus for liquid temperature control with a very broad application range ( gure 1): There is a heating device which is used to directly heat a liquid within a smaller internal tank (about 1 liter). This liquid is then pumped through a tube which is going through a larger external tank, thereby emitting energy and thus heating the liquid in the external tank (typically 10 60 liters). The temperature of the liquid in the external tank thus can be controlled by rst heating the internal liquid. The Theat T ext Pump T int Power Fig. 1. Typical hardware structure to control the liquid temperature in the external tank (right) temperature of the external liquid now depends on many parameters: the quality of the internal and the external liquid, the amount of internal liquid that is pumped through the tube per minute, the size of the interval and the external tank, the environment temperature, external disturbances, the quality of the tube, and so on.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Autonomous HVAC Control, A Reinforcement Learning Approach

Recent high profile developments of autonomous learning thermostats by companies such as Nest Labs and Honeywell have brought to the fore the possibility of ever greater numbers of intelligent devices permeating our homes and working environments into the future. However, the specific learning approaches and methodologies utilised by these devices have never been made public. In fact little inf...

متن کامل

The Study of Thermostat Impact on Energy Consumption in a Residential Building by Using TRNSYS

The present study investigates the effectiveness of thermostat control strategy in cooling energy consumption in residential buildings. To evaluate the energy consumption, two scenarios including a residential zone with and without the thermostat control system are assumed. The TRNSYS software provides an efficient numerical tool to model and evaluate a cooling system. Furthermore, since solar-...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

A learning agent for heat-pump thermostat control

Heating, Ventilation and Air Conditioning (HVAC) systems are one of the biggest energy consumers around the world. With the efforts of moving to sustainable energy consumption, heat-pump based HVAC systems have gained popularity due to their high efficiency and due to the fact that they are powered by electricity rather than by gas or oil. One drawback of heat-pump systems is that their efficie...

متن کامل

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

High Quality Thermostat Control by Reinforcement Learning -a Case Study A. System Description

نویسنده

چکیده

منابع مشابه

Autonomous HVAC Control, A Reinforcement Learning Approach

The Study of Thermostat Impact on Energy Consumption in a Residential Building by Using TRNSYS

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

A learning agent for heat-pump thermostat control

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

عنوان ژورنال:

اشتراک گذاری